Reliability Modeling of Large Fault-Tolerant Systems
نویسندگان
چکیده
A cluster based ultra reliable architecture is pre sented o ering synchronization and system function ality comparable to that of fully connected systems with reduced system overheads Existing combina torial and Markov models do not su ciently model concurrently occurring faults in such large systems A reliability model considering the distribution of con current faults across the system clusters is shown to increase the accuracy of reliability and system fault tolerance estimates The hybrid fault model which classi es faults based on their behavior further im proves reliability estimates and enhances the fault handling capability of each cluster Linear growth in cluster reliability with respect to cluster size is possi ble as are re nements in the convergence and consis tency algorithms for synchronization
منابع مشابه
Mathematical modeling and fuzzy availability analysis for serial processes in the crystallization system of a sugar plant
The binary states, i.e., success or failed state assumptions used in conventional reliability are inappropriate for reliability analysis of complex industrial systems due to lack of sufficient probabilistic information. For large complex systems, the uncertainty of each individual parameter enhances the uncertainty of the system reliability. In this paper, the concept of fuzzy reliability...
متن کاملTechniques for Modeling the Reliability of Fault-Tolerant Systems With the Markov State-Space Approach
This paper presents a step-by-step tutorial of the methods and the tools that were used for the reliability analysis of fault-tolerant systems. The approach of this paper is the Markov (or semi-Markov) state-space method. The paper is intended for design engineers with a basic understanding of computer architecture and fault tolerance, but little knowledge of reliability modeling. The represent...
متن کاملCoverage-based testing strategies and reliability modeling for fault-tolerant software systems
Software permeates our modern society, and its complexity and criticality is ever increasing. Thus the capability to tolerate software faults, particularly for critical applications, is evident. While fault-tolerant software is seen as a necessity, it also remains as a controversial technique and there is a lack of conclusive assessment about its effectiveness. This thesis aims at providing a q...
متن کاملReliability Growth of Fault - Tolerant Software
Two fault-tolerant software techniques are investigated: recovery block and N-version programming. For each, the stable reliability model is transformed into a model that considers reliability growth via the transformation approach based on the hyperexponential model. Analytic and numeric processing of the transformed models identify the influence of fault removal on the reliability of the faul...
متن کاملProceedings of the 2005 International Conference on Simulation and Modeling
Reliability enhancement in software system is a crucial and challenging issue. Applying efficient fault-tolerant mechanism can fulfill the system reliability requirement. This paper proposes reliability models for hierarchical and hybrid fault-tolerant software systems considering failure dependencies or related faults in software components/versions. Our system models are based on the classica...
متن کاملReliability and Performance Evaluation of Fault-aware Routing Methods for Network-on-Chip Architectures (RESEARCH NOTE)
Nowadays, faults and failures are increasing especially in complex systems such as Network-on-Chip (NoC) based Systems-on-a-Chip due to the increasing susceptibility and decreasing feature sizes. On the other hand, fault-tolerant routing algorithms have an evident effect on tolerating permanent faults and improving the reliability of a Network-on-Chip based system. This paper presents reliabili...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1992